# 
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SEQUENCE LISTING 

(1) GENERAL. INFORMATION: 

5 (i) APPLICANT: Jean -Marc Roch 

Victoria E.S. Scott 
Kristi L. Anderson 
James P. Sullivan 

10. (ii) TITLE OF INVENTION: HUMAN ENDOSULFINE GENE 

(iii) NUMBER OF SEQUENCES: 13 

(iv) CORRESPONDENCE ADDRESS: 

15 (A) ADDRESSEE: Abbott Laboratories 

(B) STREET: 100 Abbott Park Road 

(C) CITY: Abbott Park 

(D) STATE: Illinois 

(E) COUNTRY: USA 
20 (F) ZIP: 60064-3500 

^ (V) COMPUTER READABLE FORM: 
^ (A) MEDIUM TYPE: Floppy disk 

yj (B) COMPUTER: Macintosh 

2SU (C) OPERATING SYSTEM: System 7.0.1 

^ (D) SOFTWARE: MS Word 

\\ (vi) CURRENT APPLICATION DATA: 

•rA (A) APPLICATION NUMBER: 

30^ ^ (B) FILING DATE: 

(C) CLASSIFICATION: 

^ (viii) ATTORNEY /AGENT INFORMATION: 
O (A) NAME: Priscilla E. Porembski 

3tLj (B) REGISTRATION NUMBER: 3 3,207 

Q (C) DOCKET NUMBER: 6024. US. 01 

^ (ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 708/937-0378 
40 (B) TELEFAX: 708/938-2623 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO : 1 : 
45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1093 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
50 (ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

GTGACAGGAG CCGAAGCAGC AGCGCAGGTT GTCCCCGTTT CCCCTCCCCC TTCCCTTCTC 60 

CGGTTGCCTT CCCGGGCCCC TTACACTCCA CAGTCCCGGT CCCGCCATGT CCCAGAAACA 120 

55 AGAAGAAGAG AACCCTGCGG AGGAGACCGG CGAGGAGAAG CAGGACACGC AGGAGAAAGA 180 

AGGTATTCTG CCTGAGAGAG CTGAAGAGGC AAAGCTAAAG GCCAAATACC CAAGCCTAGG 240 

ACAAAAGCCT GGAGGCTCCG ACTTCCTCAT GAAGAGACTC CAGAAAGGGC AAAAGTACTT 3 00 

TGACTCAGGA GACTACAACA TGGCCAAAGC CAAGATGAAG AATAAGCAGC TGCCAAGTGC 3 60 

AGGACCAGAC AAGAACCTGG TGACTGGTGA TCACATCCCC ACCCCACAGG ATCTGCCCCA 420 

60 GAGAAAGTCC TCGCTCGTCA CCAGCAAGCT TGCGGGGTAA CCTGAGCCCC CCTCTCCTCC 480 



55 

CCTTCCTCAA CCACTGGACG TTTATATATT ATAGGCAGGG ATGAAATGGG CACCTAGTCA 540 
GATCTTCTCA GCTTGCTAGC CAGAAATGAC TGTGATTCTG CTGGGGGCTG CTGAGAAGGT 600 
AATGTAGGTT GAAAAGGGGC TCTAAGTTTA TTTATTTCGT TAGATTGACA CTTCCACACA 660 
CTCCCTGTAG TCCAGGTAGG GCCTAGAAAT AGGAAAGGCT AGGATTGGAT AATGCTGCAA 720 
5 ATGCTTTTTT TGTGTGAGAA ACTGGAGAGA TGTGATTTCT CCTTTTGGGA GAGAATGTCC 780 

CAAAATTGAT . TAGGCTGAGC CTTGGGAATA GTTTGGCAGG TTTAACATCC CAAGGCTAAC 840 
CTAACGTAGT TGGGAAAGGT AGATTGAATG AGACATGTTT TCTGTGCTTC TAAGTGTTCT 9 00 
GTCCCTTAGG CTGCTATTGC TT'CATGTTTC CATTATGGCA GGTTTAGAGA ATCCTTAAAA 9 60 
AGAAAAATTG ACTTGCTTGC CTAAAACTAC AGTGCCCCCT TAGCCTCCAT TACTTAGTAT 1020 
10 CTCTTACAGT TTGCTCTGGC TCTCAAATAA TATAAAGATT GATGAACATT ATTCACAAAA 1080 

AAAAAAAAAA AAAGGGCGGC GCG 1103 

(2) INFORMATION FOR SEQ ID NO : 2 : 
15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1199 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
20 (ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

2 GTGACAGGAG CCGAAGCAGC AGCGCAGGTT GTCCCCGTTT CCCCTCCCCC TTCCCTTCTC 60 

CGGTTGCCTT CCCGGGCCCC TTACACTCCA CAGTCCCGGT CCCGCCATGT CCCAGAAACA 120 
AGAAGAAGAG AACCCTGCGG AGGAGACCGG CGAGGAGAAG CAGGACACGC AGGAGAAAGA 180 
AGGTATTCTG CCTGAGAGAG CTGAAGAGGC AAAGCTAAAG GCCAAATACC CAAGCCTAGG 240 
T ACAAAAGCCT GGAGGCTCCG ACTTCCTCAT GAAGAGACTC CAGAAAGGGC AAAAGTACTT 3 00 

TGACTCAGGA GACTACAACA TGGCCAAAGC CAAGATGAAG AATAAGCAGC TGCCAAGTGC 3 60 
^ AGGACCAGAC AAGAACCTGG TGACTGGTGA TCACATCCCC ACCCCACAGG ATCTGCCCCA 420 

^ GAGAAAGTCC TCGCTCGTCA CCAGCAAGCT TGCGGGTGGC CAAGTTGAAT GATGCTGCCC 480 

r GGGGCTCTGC CAGATCCTGA GACTGCTTTT GCCGCTTCCC CTCCCTGCCC CACCCGGGTC 540 

Q CTGTGCTGGC TCCTGCCCCT TCAGCCAGGG GTCAGGAGGT GGCTCGGGTG TGGGCTGGAG 600 

J AGGCAGAAGC CCTTTCCTGT TGGTGTCCCA GCACATGGAG CCCCTTGGGC TGAGCACCAA 660 

i GACCTTGAAC CTTTTTTGTT TTACCTTTTT TCCAAATAAC AGTTGGGAGA AATATCAATG 720 

3^ AAATTCTGGG GGTGGGGGTG GGGCTGAAAG GGTGGGGTGG GAGATATGGA GGAGTATGAA 7 80 

ly TTAGGGCTTG GAGTTGGTAA AAACATTCCT GACTATCCTT CTTAACCACG TGGCTGATGT 840 

Q GGGGTAGGTA TGAGGGGAAG GAAGTGGAGT AGCCTAATGA AAAGGGGTTC TAGTTGAGCT 9 00 

M= CTGTAGATAA ATGCCTTGTT TCAGTGTGGT TGGAGACCTG GTGTCAGATA AAAGAAACTC 9 60 

CATCCGCACA GACAGATGCA AACAGCTCCT CTAGTTCTGC AGAGCTAGTT GAGAACTCAA 1020 
40 CATTAATCAT TTTAAAAAGT ACTGTCCTTG AAATAGATTT GCTGTGGGAA GAAGGGCAGT 1080 

GAGTGTGGGA GAAAGGAGCC GTGAGCGTGG GGAACCCCAC AGAGCCCAAA GGACTTTTTC 1140 
AGTATTCGAA ATAAACAAAA CAAAAACCCA TGAAAAAACC CAAAAAAAAA AAAAAAAAA 1199 

45 (2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
50 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

Met Ser Gin Lys Gin Glu Glu Glu Asn Pro Ala Glu Glu Thr Gly Glu 
55 1 . 5 10 15 

Glu Lys Gin Asp Thr Gin Glu Lys Glu Gly lie Leu Pro Glu Arg Ala 

20 25 30 

Glu Glu Ala Lys Leu Lys Ala Lys Tyr Pro Ser Leu Gly Gin Lys Pro 

35 40 45 

60 Gly Gly Ser Asp Phe Leu Met Lys Arg Leu Gin Lys Gly Gin Lys Tyr 
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50 55 
Phe Asp Ser Gly Asp Tyr Asn Met 
65 70 
Gin Leu Pro Ser Ala Gly Pro Asp 
85 

lie Pro Thr Pro Gin Asp Leu Pro 
100 

Ser Lys Leu Ala Gly 
115 



60 

Ala Lys Ala Lys Met Lys Asn Lys 

75 80 
Lys Asn Leu Val Thr Gly Asp His 

90 95 
Gin Arg Lys Ser Ser Leu Val Thr 
105 110 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 121 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 



Met 


Ser 


Gin 


Lys 


Gin 


Glu 


Glu 


Glu 


Asn 


Pro 


Ala 


Glu 


Glu 


Thr 


Gly 


Glu 


1 










5 










10 








15 




Glu 


Lys 


Gin 


Asp 


Thr 


Gin 


Glu 


Lys 


Glu 


Gly 


He 


Leu 


Pro 


Glu 


Arg 


Ala 








20 










25 










30 






Glu 


Glu 


Ala 


Lys 


Leu 


Lys 


Ala 


Lys 


Tyr 


Pro 


Ser 


Leu 


Gly 


Gin 


Lys 


Pro 






35 










40 










45 








Gly 


Gly 


Ser 


Asp 


Phe 


Leu 


Met 


Lys 


Arg 


Leu 


Gin 


Lys 


Gly 


Gin 


Lys 


Tyr 




50 










55 










60 










Phe 


Asp 


Ser 


Gly 


Asp 


Tyr 


Asn 


Met 


Ala 


Lys 


Ala 


Lys 


Met 


Lys 


Asn 


Lys 


65 










70 










75 










80 


Gin 


Leu 


Pro 


Ser 


Ala 


Gly 


Pro 


Asp 


Lys 


Asn 


Leu 


Val 


Thr 


Gly 


Asp 


His 










85 










90 










95 




He 


Pro 


Thr 


Pro 


Gin 


Asp 


Leu 


Pro 


Gin 


Arg 


Lys 


Ser 


Ser 


Leu 


Val 


Thr 








100 










105 










110 






Ser 


Lys 


Leu 


Ala 


Gly 


Gly 


Gin 


Val 


Glu 

















115 120 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
GAAGAGACTC CAGAAAGGGC 20 



(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 



G7VAGCAATAG CAGCCTAAGG 



20 



♦ 
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(2) INFORMATION FOR SEQ ID N0:7: 

( i ) SEQUENCE CHARACTERISTICS : 
5 (A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA 

.10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

TGTGCTGGGA CACCAACAG 19 



15 (2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
O (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

S GGAAGGCGCC ATGTCCCAGA AACAAGAAG 29 



(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 ; 



CAAAGTCGAC TTACCCCGCA AGCTTGCTG 29 



(2) INFORMATION FOR SEQ ID NO: 10: 
40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
45 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

CCAAGTCGAC TCATTCAACT TGGCCACCC 29 

50 

(2) INFORMATION FOR SEQ ID NO: 11: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 amino acids 

(B) TYPE: amino acid 

55 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



60 Met Ser Ala Glu Val Pro Glu Ala Ala Ser Ala Glu Glu Gin Lys Glu 
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1 

Met Glu Asp Lys 
20 

Ala Arg Tyr Pro 
35 

Arg Lys Arg Leu 
50 

Asn Met Ala Lys 
65 

Pro Asp Lys Thr 

Leu Pro Gin Arg 
100 



5 

Val Thr Pro Glu 

His Leu Gly Gin 
40 

Gin Lys Gly Gin 
55 

Ala Lys* Met Lys 
70 

Glu Val Thr Gly 
85 

Lys Pro Ser Leu 







10 




Lys 


Ala 


Glu 


Glu 


25 








Lys 


Pro 


Gly 


Gly 


Lys 


Tyr 


Phe 


Asp 








60 


Asn 


Lys 


Gin 


Leu 






75 




Asp 


His 


He 


Pro 




90 






Val 


Arg 


Ser 


Lys 


105 









15 

Ala Lys Leu Lys 
30 

Ser Asp Phe Leu 
45 

Ser Gly Asp Tyr 

Pro Thr Ala Thr 
80 

Thr Pro Gin Asp 
95 

Leu Ala Gly 
110 



(2) INFORMATION FOR SEQ ID NO: 12: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



Glu 


Gly 


He 


Leu 


Pro 


Glu 


Arg 


Ala Glu Glu 


Ala 


Lys 


Leu 


Lys 


Ala 


Lys 


1 










5 






10 








15 




Tyr 


Pro 


Ser 


Leu 


Gly 


Gin 


Lys 


Pro Gly Gly 


Ser 


Asp 


Phe 


Leu 


Met 


Lys 








20 








25 








30 






Arg 


Leu 


Gin 


Lys 


Gly 


Gin 


Lys 


Tyr Phe Asp 


Ser 


Gly 


Asp 


Tyr 


Asn 


Met 






35 










40 






45 








Ala 


Lys 


Ala 


Lys 


Met 


Lys 


Asn 


Lys Gin Leu 


Pro 


Ser 


Ala 


Gly 


Pro 


Asp 




50 










55 






60 










Lys 


Asn 


Leu 


Val 


Thr 


Gly 


Asp 


His He Pro 


Thr 


Pro 


Gin 








65 










70 






75 













(2) INFORMATION FOR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:13: 

Glu Gly He Leu Pro Glu Arg Ala Glu Glu Ala Lys Xaa Xaa Ala Lys 

1 5 10 15 

Tyr Pro Ser Leu Gly Gin Lys Pro Gly Gly Ser Asp Phe Leu Met Lys 

20 25 30 

Arg Xaa Xaa Xaa Xaa Gin Lys Tyr Phe Asp Ser Gly Asp Tyr Asn Met 

35 40 45 

Ala Lys Xaa Xaa Xaa Xaa Xaa Xaa Gin Leu Pro Ser Ala Gly Pro Asp 

50 55 60 

Lys Asn Leu Val Thr Gly Asp His He Pro Thr Pro Gin Asp Leu Pro 
65 70 75 80 

Gin Arg 



